Video display device and video display method

ABSTRACT

A video display device includes: a video receiver that obtains video data including a video and dynamic luminance characteristics indicating a time-dependent change in luminance characteristics of the video; a tone mapping processor that, in the case where a luminance region having a luminance less than or equal to a first luminance is defined as a low luminance region, and a luminance region having a luminance exceeding the first luminance is defined as a high luminance region, (i) performs first tone mapping using first conversion characteristics when first luminance characteristics exceed a predetermined threshold value, and (ii) performs second tone mapping using second conversion characteristics when the first luminance characteristics are less than or equal to the predetermined threshold value.

TECHNICAL FIELD

The present disclosure relates to a video display device and a videodisplay method that process a video signal.

BACKGROUND ART

Patent Literature (PTL) 1 describes an HDR (high dynamic range) displaydevice that updates a display method for an HDR signal according todynamic HDR metadata.

CITATION LIST Patent Literature

PTL 1: Japanese Unexamined Patent Application Publication No.2017-184249

Non Patent Literature

NPL 1: White Paper Blu-ray Disc Read-Only Format (Ultra HD Blu-ray),Audio Visual Application Format Specifications for BD-ROM Version 3.1,August 2016(http://www.blu-raydisc.com/Assets/Downloadablefile/BD-ROM_Part3_V3.1_WhitePaper_160729_clean.pdf)

SUMMARY OF THE INVENTION Technical Problem

The present disclosure provides a video display device and a videodisplay method that can improve the quality of a video displayed.

Solution to Problem

A video display device according to one aspect of the present disclosureincludes: an obtainer that obtains video data including a video anddynamic luminance characteristics indicating a time-dependent change inluminance characteristics of the video; a tone mapping processor that,in the case where a luminance region having a luminance less than orequal to a first luminance is defined as a low luminance region, and aluminance region having a luminance exceeding the first luminance isdefined as a high luminance region, (i) performs first tone mappingusing first conversion characteristics when first luminancecharacteristics exceed a predetermined threshold value, and (ii)performs second tone mapping using second conversion characteristicswhen the first luminance characteristics are less than or equal to thepredetermined threshold value, the first luminance characteristics beingincluded in the dynamic luminance characteristics and indicating thenumber of pixels having luminances less than or equal to a secondluminance among pixels included in the low luminance region in one frameof the video, the first tone mapping maintaining the luminances lessthan or equal to the second luminance, the second tone mappingdecreasing the luminances less than or equal to the second luminance;and a display that displays a video obtained as a result of the firsttone mapping or the second tone mapping.

Advantageous Effect of Invention

The present disclosure can provide a video display device and a videodisplay method that can improve the quality of a video displayed.

BRIEF DESCRIPTION OF DRAWINGS

FIG. 1 is a diagram for illustrating the evolution of video technology.

FIG. 2 is a diagram for illustrating a relationship among videoproduction, delivery methods, and display devices when a new videorepresentation is introduced into content.

FIG. 3A is a graph showing an example of a tone map.

FIG. 3B is a graph showing an example of a tone map.

FIG. 4A is a graph showing an example of a static tone map.

FIG. 4B is a diagram showing an example of a dynamic tone map.

FIG. 5A is a graph showing an example of an EOTF (electro-opticaltransfer function) corresponding to each of HDR and SDR.

FIG. 5B is a graph showing an example of an inverse EOTF correspondingto each of HDR and SDR.

FIG. 6 is a graph showing a relationship between the luminance of avideo inputted and luminance outputted from an actual display.

FIG. 7 is a table showing an example of dynamic metadata.

FIG. 8 is a graph for illustrating a method of calculating 99Y andDY100.

FIG. 9 is a graph for illustrating a method of calculating 18G.

FIG. 10 is a block diagram illustrating an example of the configurationof a video display device according to an embodiment.

FIG. 11 is a block diagram illustrating an example of the configurationof an HDR signal converter according to the embodiment.

FIG. 12 is a flowchart showing the operations of the video displaydevice according to the present embodiment.

FIG. 13 is a graph for illustrating a method of creating conversioncharacteristics created when threshold value TH_ A<luminance compressionratio<1.

FIG. 14 is a graph for illustrating a method of creating conversioncharacteristics created when DY100<threshold value TH.

FIG. 15 is a flowchart for illustrating the first example of tonemapping.

FIG. 16 is a graph for illustrating a method of creating conversioncharacteristics in the second example.

FIG. 17 is a diagram for illustrating a method of creating conversioncharacteristics in the second example.

FIG. 18 is a diagram for illustrating conversion characteristics createdin the third example of tone mapping.

FIG. 19 is a graph showing an example of conversion characteristicscreated when both maxRGB Percentile [90] and maxRGB Percentile [98] arecloser to 18G than to 99Y.

FIG. 20 is a graph showing an example of conversion characteristicscreated when both maxRGB Percentile [90] and maxRGB Percentile [98] arecloser to 99Y than to 18G.

FIG. 21 is a flowchart for illustrating the fifth example of tonemapping.

FIG. 22 is a graph for illustrating conversion characteristics createdin the fifth example of tone mapping.

FIG. 23 is a block diagram illustrating an example of the configurationof a generation device according to the embodiment.

FIG. 24 is a block diagram illustrating an example of the configurationof a generator according to the embodiment.

FIG. 25 is a flowchart showing an example of a generation method.

FIG. 26 is a flowchart showing a step of determining luminancecharacteristics in the generation method.

DESCRIPTION OF EXEMPLARY EMBODIMENT(S) 1-1. Background

First, the evolution of video technology will be described withreference to FIG. 1. FIG. 1 is a diagram for illustrating the evolutionof video technology.

The video quality has so far been enhanced with a view to increasing thenumber of display pixels. Not only standard definition (SD) videoshaving a resolution of 720×480 pixels but also high definition (HD)videos having a resolution of 1920×1080 pixels are widely used.

In recent years, in order to further enhance the video quality, theintroduction of ultra high definition (UHD) videos having a resolutionof 3840×1920 pixels or 4K resolution of 4096×2048 pixels, that is, 4Kvideos has started.

Along with the introduction of 4K, the following has been considered:the expansion of the dynamic range, the expansion of the color gamut,the addition or improvement of the frame rate, etc.

Among these, regarding the dynamic range, HDR (high dynamic range) hasgained attraction as a technique of representing bright light, such asspecular reflection light that cannot be represented with the currenttelevision signals, at a brightness level closer to reality whilemaintaining a dark part tone. Specifically, conventional televisionsignals are called SDR (standard dynamic range) signals, and have themaximum luminance of 100 nit. In contrast, in HDR, the maximum luminanceis assumed to increase up to at least 1000 nit. The SMPTE (Society ofMotion Picture & Television Engineers), the ITU-R (InternationalTelecommunications Union Radiocommunication Sector), etc. are currentlystandardizing mastering display standards for HDR.

As with HD and UHD, examples of specific applications of HDR includebroadcasting, packaged media (e.g. Blu-ray (registered trademark) disc),and Internet delivery.

1-2. Relationship Among Creation of Master, Delivery Methods, andDisplay Devices

FIG. 2 is a diagram for illustrating a relationship among videoproduction, delivery methods, and display devices when a new videorepresentation is introduced into content.

When a new video representation (e.g. an increase in the number ofpixels) is introduced to enhance the video quality, as shown in FIG. 2,it is necessary to (1) change a master for home entertainment producedby a video producer. In response, it is also necessary not only to (2)update a delivery method, such as broadcasting, communication, and apackaged medium, but also to (3) update a display device that displaysthe video, such as a television or a projector.

1-3. Tone Mapping

Tone mapping is a process of limiting, according to a relationshipbetween the luminance of an HDR video and the maximum luminance (displaypeak luminance: DPL) of a video display device, the luminance of a videoto be less than or equal to DPL by converting the luminance of the videowhen the maximum luminance (maximum content luminance level: MaxCLL) ofthe video exceeds DPL. This process makes it possible to display thevideo without losing information about luminance in the vicinity of themaximum luminance of the video. Since this conversion depends not onlyon the characteristics of video display devices but also on how videosare displayed, different conversion characteristics are used for eachvideo display device.

FIG. 3A and FIG. 3B each are a graph showing an example of a tone map.FIG. 3A shows a case in which DPL is 500 nit, and FIG. 3B shows a casein which DPL is 1000 nit. Moreover, FIG. 3A and FIG. 3B show a tone mapwhen a video having MaxCLL of 1000 nit is displayed, and a tone map whena video having MaxCLL of 4000 nit is displayed, respectively.

As shown in FIG. 3A, when DPL is 500 nit, the luminance of both videosis converted so that both videos can be displayed at a luminance up toMaxCLL but not greater than 500 nit. However, the degree of conversionis greater for one of the videos having higher MaxCLL.

As shown in FIG. 3B, when DPL is 1000 nit, tone mapping is not performedon the video having MaxCLL of 1000 nit. Tone mapping is performed on thevideo having MaxCLL of 4000 nit to convert the luminance from 4000 nitto 1000 nit, and the video is displayed at 1000 nit.

1-4. Dynamic Metadata and Dynamic Tone Map

FIG. 4A is a graph showing an example of a tone map in which staticmetadata is used. FIG. 4B is a diagram showing an example of a dynamictone map in which dynamic metadata is used.

As shown in FIG. 4A, when static metadata (MaxCLL) is used, since MaxCLLshows the maximum luminance in a series of videos, a video displaydevice can perform only static tone mapping on the series of videos. Incontrast, as shown in (a) in FIG. 4B, when metadata (here referred to asDynamic MaxCLL) suited to time-variable luminance is used, the videodisplay device does not perform tone mapping when the luminance is low((b) in FIG. 4B), and performs tone mapping when the luminance is high((c) in FIG. 4B). In this manner, the video display device can achieveoptimal tone mapping suited to the time-variable luminance. Dynamicmetadata is dynamic luminance characteristics indicating atime-dependent change in the luminance characteristics of a video. Theluminance characteristics of the video used as the dynamic metadata maybe the maximum luminance, the average luminance, etc. in eachpredetermined section of the video. In the present disclosure, theluminance characteristics of a video will be described as an example ofthe maximum luminance of the video. Examples of a predetermined sectionof a video include a scene, a chapter, and a frame.

1-5. EOTF

Here, EOTF will be described with reference to FIG. 5A and FIG. 5B.

FIG. 5A is a graph showing an example of an EOTF (electro-opticaltransfer function) corresponding to each of HDR and SDR.

EOTF is generally referred to as gamma curve, indicates correspondencebetween code values and luminance values, and is used to convert a codevalue into a luminance value. In other words, EOTF is relationshipinformation indicating a correspondence relationship between code valuesand luminance values.

FIG. 5B is a graph showing an example of an inverse EOTF correspondingto each of HDR and SDR.

Inverse EOTF indicates a correspondence between luminance values andcode values, and is used to quantize and convert a luminance value intoa code value inversely with EOTF. In other words, inverse EOTF isrelationship information indicating a correspondence relationshipbetween luminance values and code values. For example, when theluminance values of an HDR-compatible video are represented by codevalues having 10-bit tones, luminance values in an HDR luminance rangeup to 10000 nit are quantized and mapped to 1024 integer values rangingfrom 0 to 1023. In other words, the luminance values (the luminancevalues of the HDR-compatible video) in the luminance range from 0 to10000 nit are converted into HDR signals that are 10-bit code values bybeing quantized in accordance with the inverse EOTF. HDR-compatible EOTF(hereinafter referred to as “HDR EOTF”) or HDR-compatible inverse EOTF(hereinafter referred to as “HDR inverse EOTF”) makes it possible torepresent a luminance value higher than SDR-compatible EOTF (hereinafterreferred to as “SDR EOTF”) or SDR-compatible inverse EOTF (hereinafterreferred to as “SDR inverse EOTF”). For example, in FIG. 5A and FIG. 5B,the maximum value of luminance (peak luminance) is 10000 nit. In otherwords, the HDR luminance range encompasses an SDR luminance range, andan HDR peak luminance is greater than an SDR peak luminance. The HDRluminance range is a luminance range having the peak luminance that isincreased from 100 nit, which is the peak luminance of the SDR luminancerange, to 10000 nit.

Examples of the HDR EOTF and the HDR inverse EOTF include SMPTE 2084standardized by the Society of Motion Picture and Television Engineers(SMPTE).

1-6. Conventional Techniques

In the conventional techniques, metadata indicating single maximumluminance information is shown for whole content, and a display processis performed using a tone map configured for single content.Accordingly, conventional video display devices perform tone mappingthat adjusts the maximum luminance of content indicated by maximumluminance information to the display maximum luminance of the videodisplay devices such that, for example, even when a dark scene includesno high luminance information, the video display devices can display thescene at luminance up to high luminance.

This problem, however, can be solved by giving dynamic metadataindicating luminance information for each scene to the video displaydevices. In other words, the video display devices can perform optimaltone mapping for each scene using the dynamic metadata, therebyimproving luminance and tones.

1-6-1. Problem 1

As stated above, the problem of the conventional techniques can besolved by using the dynamic range of luminance for each scene.

However, merely using maximum luminance information as shown in FIG. 6does not lead to find an optimal way of specifically drawing conversioncharacteristics for use in tone map, and may cause loss of luminance ortones depending on content.

1-6-2. Solution to Problem 1

In view of the above, the present disclosure defines dynamic metadatafor optimizing conversion characteristics for use in tone map, anddescribes an algorithm for creating conversion characteristics for usein tone map in accordance with the dynamic metadata. By optimizing atone map using dynamic data or feature data of a video equivalent to thedynamic metadata, which is obtained by analyzing a main video, a videodisplay device can display a video that achieves an optimal tone mapaccording to a weight assigned to each luminance of the video.

The details will be described as an embodiment below.

It should be noted that FIG. 6 is a graph showing a relationship betweenthe luminance (Scene Luminance) of a video inputted and luminance(Display Luminance) outputted from an actual display. In other words,FIG. 6 shows conversion characteristics for use in tone map.

1-7. Dynamic Metadata

FIG. 7 is a table showing an example of dynamic metadata.

The dynamic metadata includes information shown in FIG. 7.

Specifically, the dynamic metadata includes 99Y, 18G, maxRGB Percentile(1%), maxRGB Percentile (25%), maxRGB Percentile (50%), maxRGBPercentile (75%), maxRGB Percentile (90%), maxRGB Percentile (95%),(99.98%), average max RGB, knee_point_x, knee_point_y, Bezier_anchor(0-9), and DY100. In particular, the dynamic metadata may include 99Yand DY100.

The dynamic metadata may be information indicating the luminancecharacteristics of each of frames included in a main video, andinformation indicating luminance characteristics for each scene perframe. It should be noted that when the dynamic metadata is theinformation indicating the luminance characteristics for each scene, thedynamic metadata may be the maximum value or average value of theluminance characteristics of the frames included in each scene.

The following illustrates, as an example, a case in which the dynamicmetadata is information indicating luminance characteristics for eachscene. FIG. 8 is a graph for illustrating a method of calculating 99Yand DY100. FIG. 9 is a graph for illustrating a method of calculating18G. FIG. 8 and FIG. 9 each are a graph showing the luminancedistribution of pixels in one frame.

As shown in FIG. 8, 99Y_(F) indicates the maximum luminance value in arange not exceeding 99.99% when all the pixels in one frame areaccumulated from a low luminance side of a luminance histogram showing arelationship between luminances and the number of pixels having theluminances. 99Y is set to the maximum value among 99Y_(F) of framesconstituting a scene. It should be noted that 99Y is an integer from 0to 4095 obtained by normalizing a luminance value (0 nit to 10000 nit)to 12 bit (0 to 4095).

18G is calculated under the following conditions.

-   -   For each of frames constituting one scene, a cumulative        histogram having intervals of 50 nit (50-nit cumulative        histogram) is constructed with a target luminance as a start        point, based on a luminance histogram of the frame.    -   A point at which a cumulative value is largest in the 50-nit        cumulative histogram (the maximum cumulative value) and the        luminance value thereof are calculated.    -   Luminance is increased from the luminance value, and a luminance        value that first falls below 10% of the maximum cumulative value        is determined as 18G.    -   18G when a luminance distribution from 0 nit to 18G exceeds 80%        of the entire luminance distribution is determined as a valid        value, and 18G when the luminance distribution does not is        determined as an invalid value.

In other words, 18G is derived as a luminance value inDistribution_Cutoff shown in FIG. 9. 18G is an average of 18G of framesconstituting a scene. When 18G is equal to 0, such 18G should not beincluded in the calculation of 18G.

It should be noted that Distribution_Cutoff is a threshold valueobtained by multiplying Cutoff_Threshold and Distribution_Peak.Distribution_Peak is a pixel count at the peak of the luminancedistribution of pixels in a frame. Cutoff_Threshold is, for example,0.10. In other words, Distribution_Cutoff is 10% of the pixel count ofDistribution_Peak.

maxRGB Percentile [k] is determined by the following calculation. Itshould be noted that k is any one of 1, 25, 50, 75, 90, 95, and 99.98.

-   -   A value obtained by EOTF converting each pixel RGB of a screen        is normalized.    -   A histogram for the maximum value (maxRGB) of the RGB values        normalized for each pixel is constructed.    -   A value when a pixel area is accumulated from 0 and a counted        pixel area reaches a value of k % is determined as maxRGB        Percentile [k].

average maxRGB is a value obtained by averaging maxRGB values for eachpixel in one frame by an entire screen.

knee_point_x and knee_point_y indicate the range of a linear portion ofconversion characteristics for use in tone map. In other words, theconversion characteristics are linear from (0, 0) to (knee_point_x,knee_point_y).

Bezier_anchor (0-9) indicates a Bezier coefficient for determining atone map in a region above knee_point_x, y.

DY100_(F) (DistributionY100nit) indicates a proportion of pixels havingluminances less than or equal to 100 nit to all the pixels in aluminance histogram for one frame. DY100F is not estimated from a“Distribution MaxRGB Percentile” value, and is the number of pixelsincluded in one frame and accumulated in a range of 0 to 100.23 [nit] tomore accurately find a distribution of pixels having low luminances. Itshould be noted that DY100F is derived as a percentile of pixelsincluded in a frame and having luminances less than or equal to 100.23nit (10 bit/[0:1023] and Y=520). DY100 is configured as an average valueof DY100F of frames constituting a scene. DY100 is an integer from 0 to100.

It should be noted that the luminance value (in nit or cd/m²) of a pixelfor deriving both 99Y and DY100 needs to be converted from Y by themethod specified in SMPTE ST. 2084. Further, as indicated by Equation 1below, Y′[0:1], normalized Y, needs to be converted from an R′G′B′ pixelvalue by the method specified in ITU-T BT. 2020.

Y′=0.2627R′+0.6780G′+0.0593B′  (Equation 1)

It should be noted that dynamic metadata is included in video data ascontent information, and a video reproduction device reproducing a videoadds the dynamic metadata to a main video for each scene when the videois reproduced, and transmits the dynamic metadata to a video displaydevice. The luminance characteristics of the main video corresponding tothe dynamic metadata can be also obtained by analyzing the main video ofcontent. In other words, the present disclosure includes tone mappingperformed based on luminance characteristics corresponding to dynamicmetadata that are obtained by the video display device analyzing themain video. The present disclosure illustrates, for example, as dynamicluminance characteristics indicating a time-dependent change in theluminance characteristics of a video, dynamic metadata and luminancecharacteristics obtained by analyzing a main video per frame or scenecomprising frames.

(Method of Generating 18G)

To determine KneePoint in a tone map, information about a luminancedistribution is needed. In particular, since kneepoint is configured bycalculating the degree of concentration in a luminance histogram forentire one frame, accurate values are needed.

A percentile value is an integrated value of pixels in one frame inincreasing order of luminance, and is discrete. Consequently, thepercentile value indicates an area distribution of a determinedluminance. For this reason, the percentile value does not sufficientlyshow a degree of luminance concentration. For example, when a degree ofluminance concentration is determined using maxRGB Percentile [75], itis impossible to distinguish between a case in which luminance isconcentrated up to 74%, and luminance away from the luminance at 74% areat 75%, and a case in which luminance is concentrated up to 75%.

For this reason, to show luminance concentration, a luminance value(Distribution_Peak_count_Luminance) indicating the maximum count and themaximum count (Distribution_Peak) are detected in a luminance histogramusing 18G that enables detection of a luminance distribution thecontinuity of which can be determined. Moreover, a luminance value isincreased from Distribution_Peak_count_Luminance, a luminance isidentified at which a count falls below 10% of Distribution_Peak, andthe identified luminance is referred to as 18G_measure.

The first problem in generating 18G is that in an operation based on asimple histogram, the count may momentarily be small, and a wrong valuemay result. In contrast, in the method of generating 18G, byconstructing a luminance histogram including values of integral atintervals of 50 nit, a sensitivity to a momentary change is reduced andan optimal value is obtained.

Moreover, the second problem in generating 18G is that, even when valuesare used which are calculated from the luminance histogram including thevalues of integral at intervals of 50 nit, a degree of concentration maybe small in the case of, for example, a luminance distribution that isaverage for an entire frame but in which the peak is high andconcentrated. In response, to exclude a case in which the degree ofconcentration is small, a condition that an integrated value (apercentile value) up to 18G in the luminance histogram exceeds 80% ismade for the obtained result. With this, an integrated value less than80% does not reach 18G, thereby excluding the case in which the degreeof concentration is small.

1-8. Configuration

Next, the following describes the configuration of video display device100 according to the present embodiment.

FIG. 10 is a block diagram illustrating an example of the configurationof the video display device according to the embodiment. FIG. 11 is ablock diagram illustrating an example of the configuration of an HDRsignal converter according to the embodiment.

Video display device 100 includes video receiver 110, tone mappingprocessor 120, and display 130.

Video receiver 110 receives video data including a main video, which isa video, and dynamic luminance characteristics. In other words, videoreceiver 110 serves as an obtainer that obtains video data. Videoreceiver 110 transmits the received video data to tone mapping processor120.

Tone mapping processor 120 performs tone mapping using predeterminedconversion characteristics. Tone mapping processor 120 includes HDRsignal converter 121 and tone map generation device 122.

HDR signal converter 121 optimizes the luminance information of the mainvideo, which is an HDR video, to the luminance of display 130, andoutputs the optimized luminance information.

Moreover, as shown in FIG. 11, HDR signal converter 121 includes inputsignal-luminance converter circuit 123 that converts a code valueindicating the luminance of the main video into luminance, andluminance-output level converter circuit 124 that converts luminanceinto a code value converted into the luminance of display 130.Luminance-output level converter circuit 124 performs tone mapping usingconversion characteristics created by tone map generation device 122,and causes display 130 to optimally display a video obtained as a resultof the tone mapping.

Tone map generation device 122 optimizes the conversion characteristicsused by HDR signal converter 121, according to the luminance of display130. Tone map generation device 122 obtains from video receiver 110 thedynamic luminance characteristics included in the video data, andcreates optimal conversion characteristics for use in tone map. Examples1 to 5 will separately illustrate a specific method of creatingconversion characteristics below.

Display 130 displays the video obtained as the result of the tonemapping.

1-9. Operations

FIG. 12 is a flowchart showing the operations of the video displaydevice according to the present embodiment.

In video display device 100, first, video receiver 110 obtains videodata including a main video, which is a video, and dynamic luminancecharacteristics indicating a time-dependent change in the luminancecharacteristics of the video (S1).

Next, tone mapping processor 120 performs tone mapping on the main videousing conversion characteristics most suitable for the dynamic luminancecharacteristics and the maximum display luminance that is the maximumluminance of a display device, according to the dynamic luminancecharacteristics and the maximum display luminance (S2).

Finally, display 130 displays a video obtained as a result of the tonemapping (S3).

1-10. Tone Mapping

Next, the following describes examples of tone mapping.

1-10-1. First Example of Tone Mapping

The first example of tone mapping will be described below.

The first example illustrates a case in which a tone map is generatedusing DY100 and 99Y as dynamic metadata.

First, tone map generation device 122 identifies the luminance dynamicrange of a main video using 99Y, and determines whether to perform (1)luminance compression or (2) no luminance compression on the identifiedluminance dynamic range. It should be noted that the luminancecompression is a process of decreasing the luminance of a main video toreduce the luminance dynamic range of a video that display 130 is causedto display. For example, the luminance compression is a process ofreducing a luminance dynamic range so that the maximum luminance of amain video becomes the maximum display luminance because, when themaximum luminance of the main video exceeds the maximum displayluminance, display 130 cannot be caused to display the main video at themaximum luminance. Moreover, when tone map generation device 122performs the luminance compression, tone map generation device 122 mayfurther determine whether to perform (1-1) the luminance compression onthe luminance dynamic range of the main video in a high luminance regionor (1-2) the luminance compression on the entire luminance dynamic rangeof the main video.

Furthermore, when tone map generation device 122 performs (1-2) theluminance compression on the entire luminance dynamic range, tone mapgeneration device 122 controls the luminance compression ratio of a darkpart according to DY100 so as to maintain the tones and viewability ofthe dark part.

For example, tone map generation device 122 calculates, from 99Y of thedynamic metadata and a display luminance (DPB: Display Peak Brightness)indicating the maximum display luminance, a luminance compression ratiousing Equation 2 below.

Luminance compression ratio=DPB/99Y   (Equation 2)

Tone map generation device 122 creates conversion characteristics thatvary according to the calculated luminance compression ratio. Forexample, as described in the following (1) to (3), tone map generationdevice 122 creates conversion characteristics that vary according to aluminance compression ratio.

(1) Luminance Compression Ratio≥1

In this case, tone map generation device 122 needs no tone mapping, andthus does not create conversion characteristics. In other words, in thiscase, tone map generation device 122 determines to perform no luminancecompression.

Accordingly, HDR signal converter 121 outputs the luminance of a videoin a range up to 99Y without converting the luminance. For this reason,Scene Luminance equals to Display Luminance.

(2) Threshold Value TH_A<Luminance Compression Ratio<1

Threshold value TH_A is a value obtained by multiplying DPB by apredetermined coefficient (e.g., a number greater than 0.5 and less than1). The predetermined coefficient is a value optimized based onexperience. In this case, since a luminance compression ratio is closeto 1 and luminance compression is small, conversion characteristics forthe luminance compression in a high luminance region are created. Inother words, in this case, tone map generation device 122 determines toperform the luminance compression in the high luminance region.

Specifically, as shown by “3” in FIG. 13, tone map generation device 122creates conversion characteristics including (i) a PQ curve that outputsoutput luminance (i.e. display luminance) that is identical to inputluminance (i.e. scene luminance) in a low luminance region belowkneepoint, and (ii) a curve for luminance compression that adjusts 99Yto DPB in a high luminance region above kneepoint.

(3) Luminance Compression Ratio<Threshold Value TH_A

In this case, a luminance compression ratio is greater, and thus tonemap generation device 122 creates conversion characteristics forluminance compression on an entire range to maintain the balance of anentire image. In other words, in this case, tone map generation device122 determines to perform the luminance compression on the entire range.

The following focuses especially on a low luminance region below 100 nitin terms of performing the luminance compression on the entire range. Acompression ratio often increases when luminance compression isperformed in a low luminance region in PQ that indicates the absoluteluminance. For this reason, there is a possibility that small valuesrepresenting most of the low luminance region are outputted, and thusthe details of the low luminance region are lost. In order to avoid theabove, tone map generation device 122 creates conversion characteristicsfor performing an individual process on a signal below 100 nit using theparameter of DY100. DY100 indicates a screen occupancy ratio of pixelsincluded in the low luminance region below 100 nit. Accordingly, whentone map generation device 122 performs the luminance compression on theentire range, tone map generation device 122 creates conversioncharacteristics for controlling a compression ratio for luminances lessthan or equal to 100 nit.

In other words, specifically, tone map generation device 122 createsconversion characteristics that vary depending on whether DY100 exceedsthreshold value TH. It should be noted that threshold value TH is athreshold value optimized based on experience.

(3-1) DY100>Threshold Value TH

In this case, since a proportion of pixels having luminances less thanor equal to 100 nit to total pixels is greater than threshold value TH,tone map generation device 122 determines that important details arepresent at the luminances less than or equal to 100 nit, and createsconversion characteristics for no luminance compression in a luminanceregion below 100 nit. In other words, in the case where a luminanceregion having a luminance less than or equal to a first luminance (e.g.KneePoint) is defined as a low luminance region and a luminance regionabove KneePoint is defined as a high luminance region, (i) when DY100 asfirst luminance characteristics exceeds threshold value TH, tone mapgeneration device 122 creates first conversion characteristics thatmaintain luminances in a range below 100 nit as a second luminance, thefirst luminance characteristics being included in dynamic luminancecharacteristics and indicating the number of pixels having theluminances less than or equal to the second luminance among pixelsincluded in the low luminance region in one frame of a video. Tone mapgeneration device 122 creates, for example, conversion characteristicsshown in (c) in FIG. 17 described later.

(3-2) DY100<Threshold Value TH

In this case, since a proportion of pixels having luminances less thanor equal to 100 nit to total pixels is less than threshold value TH,tone map generation device 122 determines that the influence ofluminance compression is small, and creates conversion characteristicsfor the luminance compression also in a low luminance region. In otherwords, when DY100 is less than or equal to threshold value TH, tone mapgeneration device 122 creates second conversion characteristicsdecreasing luminances in a range below 100 nit. Tone map generationdevice 122 creates, for example, conversion characteristics shown in (b)in FIG. 17 described later.

It should be noted that the luminance compression ratio in this case maybe changed according to the value of DY100, and the maximum compressionratio (e.g. 0.8) may be specified and protected. In other words, tonemap generation device 122 may create, as the second conversioncharacteristics, a conversion curve having a slope that is less than 1in the range below 100 nit. Moreover, tone map generation device 122 maycreate, as the second conversion characteristics, a conversion curvethat causes a proportion of the luminances in the range below 100 nit todecrease with a decrease in the value indicated by DY100.

FIG. 14 is a graph for illustrating a method of creating conversioncharacteristics created when DY100<threshold value TH. FIG. 14 is agraph showing a relationship of display luminance (output luminance) toscene luminance (input luminance), that is, an example of the secondconversion characteristics.

As shown in FIG. 14, in a low luminance region, the second conversioncharacteristics are a curve for luminance compression at a compressionratio from 0.8 to 1.0 of a PQ curve in a range up to 100 nit. It shouldbe noted that when DY100<threshold value TH, the compression ratio isless than 1.0. KneePoint is 100 nit. Moreover, in a high luminanceregion, the second conversion characteristics are an inverse gamma basedcurve A×Input^((1/2.2)*B) up to 99Y. Finally, the second conversioncharacteristics are a curve obtained by connecting the curve in the lowluminance region and the curve in the high luminance region so that thecurve has a continuous, more smooth slope.

Next, the following describes the first example of tone mapping withreference to a flowchart.

FIG. 15 is a flowchart for illustrating the first example of tonemapping.

Tone mapping processor 120 performs tone mapping.

In the first example, when the tone mapping is started in step S2 in theabove-described flowchart of FIG. 12, tone map generation device 122obtains dynamic metadata included in video data (S11).

Next, tone map generation device 122 calculates a luminance compressionratio using 99Y included in the dynamic metadata and DPB of videodisplay device 100, and determines whether the calculated luminancecompression ratio exceeds 1 (S12).

When tone map generation device 122 determines that the luminancecompression ratio exceeds 1 (Yes in S12), tone map generation device 122determines that luminance compression is not to be performed, andoutputs conversion characteristics for no luminance compression to HDRsignal converter 121. Subsequently, HDR signal converter 121 performstone mapping A using the conversion characteristics for no luminancecompression, and outputs a video signal obtained as a result of tonemapping A to display 130 (S13).

When tone map generation device 122 determines that the luminancecompression ratio is less than or equal to 1 (No in S12), tone mapgeneration device 122 determines whether the luminance compression ratioexceeds threshold value TH_A (S14).

When tone map generation device 122 determines that the luminancecompression ratio exceeds threshold value TH_A (Yes in S14), tone mapgeneration device 122 determines that the luminance compression is to beperformed in a high luminance region, and outputs the conversioncharacteristics shown in FIG. 13 to HDR signal converter 121.Subsequently, HDR signal converter 121 performs tone mapping B using theconversion characteristics shown in FIG. 13, and outputs a video signalobtained as a result of tone mapping B to display 130 (S15).

When tone map generation device 122 determines that the luminancecompression ratio is less than or equal to threshold value TH_A (No inS14), tone map generation device 122 determines whether DY100 includedin the obtained dynamic metadata exceeds threshold value TH (S16).

When tone map generation device 122 determines that DY100 exceedsthreshold value TH (Yes in S16), tone map generation device 122 outputsconversion characteristics for no luminance compression in a luminanceregion below 100 nit to HDR signal converter 121. Subsequently, HDRsignal converter 121 performs tone mapping C using the outputtedconversion characteristics, and outputs a video signal obtained as aresult of tone mapping C to display 130 (S17).

When tone map generation device 122 determines that DY100 is less thanor equal to threshold value TH (No in S16), tone map generation device122 outputs the conversion characteristics that are shown in FIG. 14 anddecrease luminances in a range below 100 nit, to HDR signal converter121. Subsequently, HDR signal converter 121 performs tone mapping Dusing the outputted conversion characteristics, and outputs a videosignal obtained as a result of tone mapping D to display 130 (S18).

1-10-2. Second Example of Tone Mapping

The second example of tone mapping will be described below. In thesecond example, a method of creating conversion characteristics will bedescribed that is different from the first example in which DY100 and99Y are used.

In this example, tone map generation device 122 calculates the maximumvalue of an input signal according to 99Y, and determines a slope at a100 nit point according to DY100.

FIG. 16 is a graph for illustrating the method of creating conversioncharacteristics in the second example.

In the second example, tone map generation device 122 creates conversioncharacteristics including the origin coordinates, the 100 nitcoordinates (Ks), and the maximum value coordinates (99Y, DPB: DisplayPeak Brightness) shown in FIG. 16. Tone map generation device 122calculates an output luminance (F1s(100)) corresponding to an inputluminance at 100 nit using DY100, and determines the slope of conversioncharacteristics below 100 nit and the slope of conversioncharacteristics at the 100 nit coordinates Ks to be, for example, 1.0.

Next, tone map generation device 122 determines the slope of conversioncharacteristics at 99Y using 99Y, and connects the three points of theorigin coordinates, the 100 nit coordinates Ks, and the maximum valuecoordinates with a spline curve to generate a tone map.

Specifically, as shown in (b) in FIG. 17, when DY100 is less than orequal to a predetermined threshold value, tone map generation device 122may create conversion characteristics of which the slope up to 100 nitcoordinates is decreased by reducing the value of F1s(100). Moreover, inthis case, tone map generation device 122 creates conversioncharacteristics of which the slope in a range above 100 nit is madegreater than the slope in a range below 100 nit by decreasing a slope at100 nit coordinates Ks. As a result, it is possible to ensure the tonesof the range above 100 nit.

Furthermore, at the same time, tone map generation device 122 may createconversion characteristics of which the slope is weighted usingpercentile information. For example, when maxRGB Percentile [75] becomesa value (e.g. a value within a predetermined luminance range having 99Yas a reference) close to 99Y, tone map generation device 122 candetermine that there are many gray scale components in a high luminanceregion (in the vicinity of 99Y). For this reason, tone map generationdevice 122 may create conversion characteristics of which the slopeincreases from maxRGB Percentile [75] to 99Y. As a result, it ispossible to improve the tones between maxRGB Percentile [75] and 99Y.

It should be noted that in FIG. 16 Ks (100, F1(100)) indicates theknee-point of a scene determined by estimating DY100. Further, F1s(Vi)indicates a linear function for a low-level signal (a low luminanceregion) of a scene, such as a luminance range from 0 to 100 nit of avideo input signal. F2s(Vi) indicates a spline curve function for amedium-level signal and a high-level signal (medium and high luminanceregions) of a scene, such as a luminance range from 100 nit to 99Y of avideo input signal.

1-10-3. Third Example of Tone Mapping

The third example of tone mapping will be described below. In the thirdexample, tone mapping in a high luminance region will be described.

FIG. 18 is a diagram for illustrating conversion characteristics createdin the third example of tone mapping.

In this case, tone map generation device 122 creates conversioncharacteristics that represent a luminance range from 18G to 99Y with arange of luminances from 18G to the maximum luminance (Max_Luminance) ofvideo display device 100, using 18G and 99Y of dynamic metadata. Forthis reason, as shown in (b) in FIG. 18, tone map generation device 122creates conversion characteristics for which the weights of tones in thehigh luminance region are increased, using a value of at least one ofmaxRGB Percentile [90] and maxRGB Percentile [98]. In other words, tonemap generation device 122 creates conversion characteristics of whichthe slope in the high luminance region is increased.

FIG. 19 is a graph showing an example of conversion characteristicscreated when both maxRGB Percentile [90] and maxRGB Percentile [98] arecloser to 18G than to 99Y. FIG. 20 is a graph showing an example ofconversion characteristics created when both maxRGB Percentile [90] andmaxRGB Percentile [98] are closer to 99Y than to 18G.

For example, when both maxRGB Percentile [90] and maxRGB Percentile [98]are closer to 18G than to 99Y, tone map generation device 122 setsknee_high_point, the upper limit of a region for setting KneePoint, tomaxRGB Percentile [98]. In consequence, as shown in FIG. 19, tone mapgeneration device 122 creates conversion characteristics for which theweights of tones from 18G to maxRGB Percentile [98] are increased, thatis, of which the slope is close to 1 (e.g. the slope is greater than0.8).

In contrast, when both maxRGB Percentile [90] and maxRGB Percentile [98]are closer to 99Y than to 18G, tone map generation device 122 setsknee_high_point to be in the vicinity of Max_Luminance. In consequence,as shown in FIG. 20, tone map generation device 122 creates conversioncharacteristics for which the weights of tones from knee_high_point to99Y are increased, that is, of which the slope is close to 1 (e.g. theslope is greater than 0.8).

It should be noted that it is possible to generate a tone map weightedwith these values, other than the examples of FIG. 19 and FIG. 20. Tonemap generation device 122 may create conversion characteristics usingdynamic metadata, such as knee_point_x, knee_point_y, and bezier_anchor.

It should be noted that the third example of tone mapping may beperformed for (2) Threshold Value TH_A<Luminance Compression Ratio<1 or(3-1) DY100>Threshold Value TH in (3) Luminance CompressionRatio<Threshold Value TH_A of the first example.

As described above, in tone mapping B or tone mapping C shown in FIG.15, (i) when the third luminance is closer to 18G as the secondluminance than to the maximum luminance (i.e. 99Y) in one frame, tonemap generation device 122 creates conversion characteristics of whichthe slope from the second luminance to the third luminance is greaterthan a slope in a range exceeding the third luminance, the thirdluminance being a luminance when a cumulative value from 0 reaches atleast one of 90% or 98% as the first proportion that is at least 90% oftotal pixels in a histogram of maxRGB of each pixel in the one frame(i.e. at least one of maxRGB Percentile [90] and maxRGB Percentile[98]).

Moreover, in tone mapping B or tone mapping C shown in FIG. 15, (ii)when the third luminance is closer to the maximum luminance than to thesecond luminance, tone map generation device 122 creates conversioncharacteristics of which the slope from the second luminance to thethird luminance is less than a slope in a range exceeding the thirdluminance.

As a result, when at least one of maxRGB Percentile [90] and maxRGBPercentile [98] is a value closer to 18G than to 99Y, it can be saidthat at least 90% of the total pixels is concentrated on the luminancesclose to 18G. For this reason, it is possible to increase the weights oftones from 18G to maxRGB Percentile [98] by performing tone mappingusing conversion characteristics of which the slope from 18G to maxRGBPercentile [98] is made greater than a slope exceeding maxRGB Percentile[98]. Accordingly, it is possible to improve the tones of the pixelshaving the luminances concentrated in one frame, thereby enhancing thevideo quality.

Moreover, when at least one of maxRGB Percentile [90] and maxRGBPercentile [98] is a value closer to 99Y than to 18G, it can be saidthat the remaining 10% or 2% of the total pixels is concentrated in thehigh luminance region up to 99Y. For this reason, it is possible toincrease the weights of tones from maxRGB Percentile [90] to 99Y byperforming tone mapping using conversion characteristics of which theslope from maxRGB Percentile [90] to 99Y is made greater than a slopefrom 18G to maxRGB Percentile [90]. Accordingly, it is possible toimprove the tones of the pixels having the luminances concentrated inone frame, thereby enhancing the video quality.

It should be note that although 18G is illustrated as the secondluminance in the third example, the second luminance may be 100 nit.

1-10-4. Fourth Example of Tone Mapping

The fourth example of tone mapping will be described below. In thefourth example, tone mapping in a low luminance region will bedescribed.

The luminance compression is performed in the low luminance region whenDY100 is less than or equal to threshold value TH_A in the firstexample, but the present disclosure is not limited to this. For example,it may determine whether to perform the luminance compression in the lowluminance region according to a value of maxRGB Percentile [1].

Specifically, tone map generation device 122 determines whether aluminance distribution in the low luminance region (dark part) is broador narrow depending on whether maxRGB Percentile [1] of dynamic metadatahas a luminance higher than predetermined luminance. Then, when tone mapgeneration device 122 determines that the luminance distribution in thedark part is narrow, tone map generation device 122 may createconversion characteristics for compressing a luminance dynamic range byperforming tone mapping on a dark part side, and expanding an entiredynamic range as shown in, for example, (b) in above-described FIG. 17.For example, when maxRGB Percentile [1] has 200 nit, the area of pixelshaving luminances less than or equal to 200 nit accounts for at most 1%of the area of an entire screen. For this reason, an overall influenceis small even when luminance information about the luminances less thanor equal to 200 nit is compressed. Further, at the same time, it ispossible to output a video signal in which the tones of an input signalgreater than or equal to 200 nit are maintained, by causing a slope in aregion above 200 nit to be 1.

1-10-5. Fifth Example of Tone Mapping

The fifth example of tone mapping will be described below. In the fifthexample, a method of determining KneePoint (knee-point) will bedescribed.

Tone map generation device 122 determines KneePoint (a point at whichcompression of content luminance is started) of conversioncharacteristics using 99Y and 18G. As a result, it is possible toappropriately improve the contrast of a specific luminance region.

When, for example, 99Y is higher than DPB of video display device 100,tone map generation device 122 uses 18G to determine KneePoint forconversion characteristics, 99Y serving as a convergent point. Tone mapgeneration device 122 sets the following values for 99Y.

Tone map generation device 122 sets, as kneepoint_max, the upper valueat which conversion characteristics allow for visual identification ofall the tones. Moreover, tone map generation device 122 sets, askneepoint_min, a value at which conversion characteristics allow foruniform, visual identification of all the tones. Here, kneepoint_max isthe upper value of a luminance range for determining KneePoint, andkneepoint_min is the lower value of the luminance range for determiningKneePoint.

Furthermore, since at least 80% of a luminance distribution isconcentrated at a value less than or equal to the value indicated by18G, tone map generation device 122 determines kneepoint using 18G.

FIG. 21 is a flowchart for illustrating the fifth example of tonemapping. FIG. 22 is a diagram for illustrating conversioncharacteristics created in the fifth example of tone mapping.

In the fifth example, when the tone mapping is started in step S2 in theabove-described flowchart of FIG. 12, tone map generation device 122obtains dynamic metadata included in video data (S21).

Next, tone map generation device 122 determines whether 99Y included inthe dynamic metadata is less than or equal to DPB of video displaydevice 100 (S22).

When tone map generation device 122 determines that 99Y is less than orequal to DPB (Yes in S22), tone map generation device 122 sets Knee_end,the convergent point of conversion characteristics on a high luminanceside, to DPB (S23).

When tone map generation device 122 determines that 99Y exceeds DPB (Noin S22), tone map generation device 122 sets Knee_end to 99Y (S24).

Subsequently, tone map generation device 122 determines whether 18G isless than KneePoint_max (S25).

When tone map generation device 122 determines that 18G is greater thanor equal to KneePoint_max (No in S25), tone map generation device 122sets KneePoint to KneePoint_max (S26).

When tone map generation device 122 determines that 18G is less thanKneePoint_max (Yes in S25), tone map generation device 122 determineswhether 18G is less than KneePoint_min (S27).

When tone map generation device 122 determines that 18G is less thanKneePoint_min (Yes in S27), tone map generation device 122 setsKneePoint to KneePoint_min (S28).

When tone map generation device 122 determines that 18G is greater thanor equal to KneePoint (No in S27), tone map generation device 122 setsKneePoint to 18G.

When steps S23, S26, S28, and S29 end, tone map generation device 122terminates the process.

Tone map generation device 122 may create the conversion characteristicsin the above first to fourth examples using KneePoint set in theprocess.

2. Method of Generating Dynamic Metadata

Next, the following describes a method of generating dynamic metadata.

Hereinafter, a method of generating metadata is disclosed that isrequired to solve the conventional problem with a dynamic tone map.

2-1. Configuration of Generation Device

The following describes the configuration of a generation device thatgenerates dynamic metadata.

FIG. 23 is a block diagram illustrating an example of the configurationof the generation device according to the embodiment. FIG. 24 is a blockdiagram illustrating an example of the configuration of a generatoraccording to the embodiment.

Generation device 200 includes video receiver 210, generator 220, andmemory 230.

Video receiver 210 receives a main video that is a video. In otherwords, video receiver 210 serves as an obtainer that obtains the mainvideo. Video receiver 210 outputs the received main video to generator220.

Generator 220 analyzes the main video outputted by video receiver 210 togenerate dynamic metadata indicating luminance characteristics for eachscene. Specifically, generator 220 generates dynamic metadata for eachframe, and temporarily stores into memory 230 the dynamic metadatagenerated from the frames included in one scene. Subsequently, generator220 generates the dynamic metadata for each scene by calculating anaverage value or the maximum value using the dynamic metadata for asmuch as one scene. It should be noted that generator 220 may outputmetadata generated for each frame.

Generator 220 includes video information luminance converter 221,luminance histogram constructor 222, and determiner 223.

Video information luminance converter 221 converts a video signal havingan RGB value, into a luminance signal.

Luminance histogram constructor 222 constructs a luminance histogramfrom signal information obtained by video information luminanceconverter 221 performing luminance conversion.

Determiner 223 determines dynamic metadata for each frame using theluminance histogram constructed by luminance histogram constructor 222.Moreover, determiner 223 merges temporally similar video informationusing the dynamic data generated from the frames included in one scenetemporarily stored in memory 230. Here, the term “merge” meanscalculating the maximum value in a scene (similar frame) at 99Y and anaverage value in a scene (similar frame) at 100DY.

Memory 230 temporarily stores the dynamic metadata for each framegenerated by generator 220.

2-2. Operations of Generation Device

Next, the following describes a method of generating dynamic metadata bya generation device.

FIG. 25 is a flowchart showing an example of a generation method.

First, in generation device 200, video receiver 210 obtains a main video(S31).

Next, generator 220 starts a loop for repeating step S32 and step S33for each of frames constituting the main video obtained by videoreceiver 210.

Generator 220 determines luminance characteristics for a frame to beprocessed (S32). The details of the step of determining luminancecharacteristics will be described with reference to FIG. 26.

FIG. 26 is a flowchart showing the step of determining luminancecharacteristics in the generation method.

Generator 220 analyzes the luminances of all pixels included in theframe to be processed, and constructs a luminance histogram (S41).

Next, generator 220 starts counting an integrated value in order fromlow luminance in the luminance histogram (S42). Specifically, generator220 counts a pixel having a set luminance while sequentially increasingluminance from 0 nit by 1 nit in the luminance histogram.

Next, generator 220 determines whether a luminance value to be countedis 100 nit (S43).

When generator 220 determines that the luminance to be counted is 100nit (Yes in S43), generator 220 determines, as DY100, a value obtainedby dividing the integrated value counted thus far by the total number ofpixels (S44). In other words, generator 220 determines, as DY100 that isfirst luminance characteristics, a value obtained by dividing, for eachof the frames constituting the video, the number of pixels having atmost 100 nit as a predetermined luminance among pixels included in theframe by the total number of the pixels included in the frame.

After step S44 or when generator 220 determines that the luminance to becounted is not 100 nit (No in S43), generator 220 determines whether thevalue obtained by dividing the current integrated value by the totalnumber of the pixels exceeds 99.99% (S45).

When generator 220 determines that the value obtained by dividing thecurrent integrated value by the total number of the pixels exceeds99.99% (Yes in S45), generator 220 determines the luminance to becounted as 99Y (S46). In other words, here, generator 220 identifies,for each of the frames constituting the video, the maximum luminancethat is the luminance at 99.99% of all the pixels when all the pixelsincluded in the frame are arranged in order of increasing luminance, anddetermines the identified maximum luminance as 99Y that is secondluminance characteristics.

After step S46 or when generator 220 determines that the value obtainedby dividing the current integrated value by the total number of thepixels is less than or equal to 99.99% (No in S45), generator 220determines whether 100DY and 99Y are already determined (S47).

When generator 220 determines that 100DY and 99Y are already determined(Yes in S47), generator 220 ends the step of determining luminancecharacteristics.

When generator 220 determines that 100DY and 99Y are not alreadydetermined (No in S47), generator 220 increases the luminance to becounted by 1 nit and returns to step S43.

Returning to FIG. 25, when generator 220 ends the step of determiningluminance characteristics, generator 220 outputs the determinedluminance characteristics to memory 230 (S33). Next, when luminancecharacteristics for as much as one scene are accumulated in memory 230,generator 220 performs merging using the luminance characteristics foras much as one scene, outputs dynamic metadata resulting from themerging, and ends the process.

It should be noted generation device 200 records the outputted dynamicmetadata in the supplemental enhancement information (SEI) of content.In other words, generation device 200 may record the dynamic metadatatogether with the video onto a recording medium, such as HDD, SSD, andBD.

It should be noted that although the example has been described in whichDY100 and 99Y included in dynamic metadata are generated as the dynamicmetadata generated by generation device 200, other dynamic metadata maybe generated in the same manner.

As described above, generation device 200 can analyze the video togenerate the dynamic metadata. Accordingly, because the video displaydevice obtains, in addition to the video, the dynamic metadataindicating the dynamic luminance characteristics of the video, the videodisplay device can perform tone mapping according to the luminancecharacteristics of the video indicated by the dynamic metadata. In otherwords, the video display device can perform dynamic tone mapping withoutanalyzing a video, and reduce processing load.

Further, because the video display device can reduce a processing timefor analyzing a video, the video display device can effectively performdynamic tone mapping on the video.

3. Variations

A main video is, for example, an HDR video. The HDR video may be a videoon, for example, a Blu-ray disc, a DVD, a video distribution site on theInternet, a broadcast, or an HDD (Hard Disk Drive).

The video reproduction device may be a device that decodes compressedvideo signals from a recording medium, a broadcast, or the Internet, andtransmits the decoded video signals to the video display device.Examples of the device include a disc player, a disc recorder, a set topbox, a television set, a personal computer, and a smartphone. Part orall of the functions of the video reproduction device may be included invideo display device 100.

A video signal transmitting means that transmits video signals from thevideo reproduction device to the video display device may be a meansthat transmits video signals in uncompressed form, such as HDMI(registered trademark), DVI, or DP, and maybe a means that transmitsvideo signals in compressed form, such as transmission via a network.

The maximum luminance information or tone mapping information of thevideo display device may be set in the video reproduction device by auser providing input to the video reproduction device using a remotecontrol or an operating portion of the video reproduction device.Alternatively, the user may obtain such information using the Internetor another means, store the obtained information in a portable storagemedium, and transmit the information to the video reproduction devicevia the portable storage medium. Moreover, the video reproduction devicemay be directly connected to the Internet, and the video reproductiondevice may obtain such information from the database of a server.Furthermore, the video reproduction device may display a test pattern onthe video display device, and obtain and store the information whilechecking the characteristics of the video display device using thedisplayed test pattern.

Although the video display method and luminance characteristicsgeneration method according to the embodiment of the present disclosurehave been described above, the present disclosure is not limited to theembodiment.

Moreover, each of processing units included in the video display deviceand the generation device according to the embodiment is typicallyimplemented as LSI (large scale integration) that is an integratedcircuit. These may be implemented in a single chip individually, or in asingle chip that includes some or all of them.

Moreover, the method of circuit integration is not limited to LSI.Integration may be implemented with a dedicated circuit or ageneral-purpose processor. A field programmable gate array (FPGA) thatcan be programmed after manufacturing LSI or a reconfigurable processorthat allows reconfiguration of the connections and settings of circuitcells inside the LSI may be used.

Moreover, in the embodiment, the structural components may be eachconfigured using dedicated hardware or may be each realized by executinga software program suitable for the structural component. Each of thestructural components may be implemented by a program executing unit,such as a CPU or a processor, reading and executing a software programrecorded on a recording medium, such as a hard disk or a semiconductormemory.

Moreover, the present disclosure may be realized as various methodexecuted by the video display device and the generation device.

Moreover, the division of the functional blocks in the block diagram isone example, and functional blocks may be achieved as one functionalblock, one functional block may be divided into blocks, or some offunctions may be transferred to other functional blocks. Further, singlehardware or software may process similar functions of functional blocks,in parallel or by time division.

Moreover, the order in which the steps are performed in the flow chartis one example for specifically describing the present disclosure, andorder other than the above order may be used. Further, some of the stepsmay be performed simultaneously (in parallel) with other steps.

Although the video display device and generation device according to oneor more aspects have been described according to the embodiment, thepresent disclosure is not limited to the embodiment. Forms obtained byvarious modifications to the embodiment that can be conceived by aperson skilled in the art as well as forms realized by optionallycombining structural components and functions in the embodiment whichare within the scope of the essence of the present invention areincluded in the present invention.

INDUSTRIAL APPLICABILITY

The present disclosure is applicable to video display devices andgeneration devices.

REFERENCE MARKS IN THE DRAWINGS

-   100 video display device-   110 video receiver-   120 tone mapping processor-   121 HDR signal converter-   122 tone map generation device-   123 input signal-luminance converter circuit-   124 luminance-output level converter circuit-   130 display-   200 generation device-   210 video receiver-   220 generator-   221 video information luminance converter-   222 luminance histogram constructor-   223 determiner-   230 memory

1. A video display device, comprising: an obtainer that obtains videodata including a video and dynamic luminance characteristics indicatinga time-dependent change in luminance characteristics of the video; atone mapping processor that, in the case where a luminance region havinga luminance less than or equal to a first luminance is defined as a lowluminance region, and a luminance region having a luminance exceedingthe first luminance is defined as a high luminance region, (i) performsfirst tone mapping using first conversion characteristics when firstluminance characteristics exceed a predetermined threshold value, and(ii) performs second tone mapping using second conversioncharacteristics when the first luminance characteristics are less thanor equal to the predetermined threshold value, the first luminancecharacteristics being included in the dynamic luminance characteristicsand indicating the number of pixels having luminances less than or equalto a second luminance among pixels included in the low luminance regionin one frame of the video, the first tone mapping maintaining theluminances less than or equal to the second luminance, the second tonemapping decreasing the luminances less than or equal to the secondluminance; and a display that displays a video obtained as a result ofthe first tone mapping or the second tone mapping.
 2. The video displaydevice according to claim 1, wherein the tone mapping processor uses, asthe second conversion characteristics for use in the second tonemapping, a conversion curve having a slope that is less than 1 at theluminances less than or equal to the second luminance.
 3. The videodisplay device according to claim 1, wherein the tone mapping processoruses, as the second conversion characteristics for use in the secondtone mapping, a conversion curve that causes a proportion of theluminances less than or equal to the second luminance to the secondluminance to decrease with a decrease in a value indicated by the firstluminance characteristics.
 4. The video display device according toclaim 1, wherein in the first tone mapping, the tone mapping processoruses: (i) a conversion curve as the first conversion characteristicswhen a third luminance is closer to the second luminance than to amaximum luminance in the one frame, the third luminance being aluminance when a cumulative value from 0 reaches a value of a firstproportion that is at least 90% of a total number of pixels in ahistogram of maxRGB values of each pixel included in the one frame, theconversion curve having a slope from the second luminance to the thirdluminance greater than a slope in a range of luminances exceeding thethird luminance; and (ii) a conversion curve as the first conversioncharacteristics when the third luminance is closer to the maximumluminance than to the second luminance, the conversion curve having theslope from the second luminance to the third luminance less than theslope in the range of luminances exceeding the third luminance.
 5. Thevideo display device according to claim 1, wherein the tone mappingprocessor uses, as the first conversion characteristics for use in thefirst tone mapping: a conversion curve having a minimum value of aluminance range as a knee-point when the second luminance is greaterthan a maximum value of the luminance range, the luminance range beingpreset to determine the knee-point; a conversion curve having the secondluminance as a knee-point when the second luminance is located in theluminance range; and a conversion curve having the maximum value of theluminance range as a knee-point when the second luminance is less thanthe minimum value of the luminance range.
 6. The video display deviceaccording to claim 1, wherein the second luminance is 100 nit.
 7. Avideo display method for use in a video display device, the videodisplay method comprising: obtaining video data including a video anddynamic luminance characteristics indicating a time-dependent change inluminance characteristics of the video; in the case where a luminanceregion having a luminance less than or equal to a first luminance isdefined as a low luminance region, and a luminance region having aluminance exceeding the first luminance is defined as a high luminanceregion, (i) performing first tone mapping using first conversioncharacteristics when first luminance characteristics exceed apredetermined threshold value, and (ii) performing second tone mappingusing second conversion characteristics when the first luminancecharacteristics are less than or equal to the predetermined thresholdvalue, the first luminance characteristics being included in the dynamicluminance characteristics and indicating the number of pixels havingluminances less than or equal to a second luminance among pixelsincluded in the low luminance region in one frame of the video, thefirst tone mapping maintaining the luminances less than or equal to thesecond luminance, the second tone mapping decreasing the luminances lessthan or equal to the second luminance; and displaying a video obtainedas a result of the first tone mapping or the second tone mapping.